Search Results/Filters    

Filters

Year

Banks




Expert Group











Full-Text


Issue Info: 
  • Year: 

    2015
  • Volume: 

    -
  • Issue: 

    4 (SERIAL 26)
  • Pages: 

    117-125
Measures: 
  • Citations: 

    0
  • Views: 

    1782
  • Downloads: 

    0
Abstract: 

Word sense disambiguation is the task of identifying the correct sense for the Word in a given context among a finite set of possible senses, and plays an important role in many natural language processing applications such as machine translation, document classification, and information retrieval.In this paper we proposed to use LDA Topic Model for disambiguating Farsi homograph words with new features. A Topic Model is a statistical Model for extract Topics from documents of a corpus. We use Latent Dirichlet Allocation (LDA) that is an unsupervised technique.The system achieved 97% precision for 4 high frequently Farsi homograph words.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 1782

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Journal: 

Scientia Iranica

Issue Info: 
  • Year: 

    2021
  • Volume: 

    28
  • Issue: 

    3 (Transactions E: Industrial Engineering)
  • Pages: 

    1830-1852
Measures: 
  • Citations: 

    0
  • Views: 

    89
  • Downloads: 

    98
Abstract: 

Information Technology (IT), management and industrial engineering are correlated academic disciplines whose publications have risen signi ficantly over the last decades. The aim of this study is to analyze the research evolution, determine the important Topics and areas, and depict the trend of interdisciplinary Topics in these domains. To accomplish this, text mining techniques are used and a combination of bibliographic analysis and a Topic Modeling approach are applied to relevant publications in the Web of Science (WoS) repository over the last 20 years. In the Topic extraction process, a heuristic function was suggested for key extraction, and some new applicable criteria were defi ned to compare the Topics. Moreover, a novel approach was proposed to determine the high-level category for each Topic. The results determined the hot-important Topics, and incremented, decremented and fixed Topics are identi fied. Subsequently, a comparison between high-level research areas con firmed strong scienti fic relationships between them. This study presents a deep knowledge about the internal research evolution of domains and illustrates the effect of Topics on each other over the past 20 years. Furthermore, the methodology of this study could be applied to determine interdisciplinary Topics and observe the research evolution of other academic domains.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 89

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 98 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Issue Info: 
  • Year: 

    2019
  • Volume: 

    15
  • Issue: 

    4 (38)
  • Pages: 

    57-70
Measures: 
  • Citations: 

    0
  • Views: 

    591
  • Downloads: 

    0
Abstract: 

A probabilistic Topic Model assumes that documents are generated through a process involving Topics and then tries to reverse this process, given the documents and extract Topics. A Topic is usually assumed to be a distribution over words. LDA is one of the first and most popular Topic Models introduced so far. In the document generation process assumed by LDA, each document is a distribution over Topics and each word in the document is sampled from a chosen Topic of that distribution. It assumes that a document is a bag of words and ignores the order of the words. Probabilistic Topic Models such as LDA which extract the Topics based on documents-level word co-occurrences are not equipped to benefit from local word relationships. This problem is addressed by combining Topics and n-grams, in Models like Bigram Topic Model (BTM). BTM modifies the document generation process slightly by assuming that there are several different distributions of words for each Topic, each of which correspond to a vocabulary word. Each word in a document is sampled from one of the distributions of its selected Topic. The distribution is determined by its previous word. So BTM relies on exact word orders to extract local word relationships and thus is challenged by sparseness. Another way to solve the problem is to break each document into smaller parts for example paragraphs and use LDA on these parts to extract more local word relationships in these small parts. Again, we will be faced with sparseness and it is well-known that LDA does not work well on small documents. In this paper, a new probabilistic Topic Model is introduced which assumes a document is a set of overlapping windows but does not break the document into those parts and assumes the whole document as a single distribution over Topics. Each window corresponds to a fixed number of words in the document. In the assumed generation process, we walk through windows and decide on the Topic of their corresponding words. Topics are extracted based on words co-occurrences in the overlapping windows and the overlapping windows affect the process of document generation because; the Topic of a word is considered in all the other windows overlapping on the word. On the other words, the proposed Model encodes local word relationships without relying on exact word order or breaking the document into smaller parts. The Model, however, takes the word order into account implicitly by assuming the windows are overlapped. The Topics are still considered as distributions over words. The proposed Model is evaluated based on its ability to extract coherent Topics and its clustering performance on the 20 newsgroups dataset. The results show that the proposed Model extracts more coherent Topics and outperforms LDA and BTM in the application of document clustering.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 591

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Issue Info: 
  • Year: 

    2020
  • Volume: 

    18
  • Issue: 

    3
  • Pages: 

    231-240
Measures: 
  • Citations: 

    0
  • Views: 

    394
  • Downloads: 

    0
Abstract: 

Research in the field of video surveillance systems has been improving because of the increasing need for intelligent monitoring, control and management. Given the large amount of data on these intelligent transportation systems, extracting patterns and automatically labeling them is a challenging task. In this paper, a Topic Model was used to detect and extract traffic patterns at intersections so that visual patterns are transformed into visual words. The input video is first split into clips. Then, the flow characteristics of the clips, which are based on abundant local motion vector information, are computed using optical flow algorithms and converted to visual words. After that, with a non-probabilistic Topic Model, the traffic patterns are extracted to the designed system by a group sparse Topical coding method. These patterns represent visible motion that can be used to describe a scene by answering a behavioral question such as: Where does a vehicle go? The results of the implementation of the proposed method on the QMUL video database show that the proposed method can correctly detect and display meaningful traffic patterns such as turn left, turn right and crossing a roundabout.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 394

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Issue Info: 
  • Year: 

    2017
  • Volume: 

    9
  • Issue: 

    2
  • Pages: 

    19-25
Measures: 
  • Citations: 

    0
  • Views: 

    152
  • Downloads: 

    67
Abstract: 

Probabilistic Topic Models have been very popular in automatic text analysis since their introduction. These Models work based on word co-occurrence, but are not very flexible with respect to the context in which cooccurrence is considered. Many probabilistic Topic Models do not allow for taking local or spatial data into account. In this paper, we introduce a probabilistic Topic Model that benefits from an arbitrary-length co-occurrence window and encodes local word dependencies for extracting Topics. We assume a multinomial distribution with Dirichlet prior over the window positions to let the words in every position have a chance to influence Topic assignments. In the proposed Model, Topics being shown by word pairs have a more meaningful presentation. The Model is applied on a dataset of 2000 documents. The proposed Model produces interesting meaningful Topics and reduces the problem of sparseness.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 152

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 67 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Issue Info: 
  • Year: 

    2020
  • Volume: 

    35
  • Issue: 

    2
  • Pages: 

    553-574
Measures: 
  • Citations: 

    0
  • Views: 

    644
  • Downloads: 

    0
Abstract: 

With the proliferation of the Internet and the rapid growth of electronic articles, text classification has become one of the key and important tools for data organization and management. In text classification a set of basic knowledge is provided to the system by learning. Then, new input documents enter to one of the subject groups. In health literature due to wide variety of Topics, preparing such a set of early education is a very time consuming and costly task. The purpose of this article is to present a hybrid Model of learning (supervised and unsupervised) for the subject classification of health scientific products that performs the classification operation without the need for an initial labeled set. To extract the thematic Model of health science texts from 2009 to 2019 at PubMed database, data mining and text mining were performed using machine learning. Based on Latent Dirichlet Allocation Model, the data were analyzed and then the Support Vector Machine was used to classify the texts. In the findings of this study, the Model was introduced in three main steps. In data preprocessing, the unnecessary words were eliminated from the data set and the accuracy of the proposed Model increased. In the second step, the themes in the texts were extracted using the Latent Dirichlet Allocation method, and as a basic training set in step 3, the data were backed up by the Support Vector Machine algorithm and the classifier learning was performed with the help of these Topics. Finally, with the help of the classification, the subject of each document was identified. The results showed that the proposed Model can build a better classification by combining unsupervised clustering properties and prior knowledge of the samples. Clustering on labeled samples with a specific similarity criterion merges related texts with prior knowledge, and the learning algorithm teaches classification by supervisory method. Combining classification and clustering can increase the accuracy of classification of health texts.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 644

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Writer: 

مهدی-جلالی

Issue Info: 
  • End Date: 

    مهر 1384
Measures: 
  • Citations: 

    0
  • Views: 

    247
  • Downloads: 

    0
Keywords: 
Abstract: 

قطعه فوق یک قطعه استراتژیک در صنعت حفاری است که دانش فنی آن را جهاد تهیه کرده است. دانش فنی این قطعه شامل مشخصات مکانیکی و متالورژیکی، نقشه فنی و نقشه بازرسی و همچنین اسکوپ بازرسی است.

Yearly Impact:   مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 247

Issue Info: 
  • Year: 

    2022
  • Volume: 

    52
  • Issue: 

    4
  • Pages: 

    281-291
Measures: 
  • Citations: 

    0
  • Views: 

    155
  • Downloads: 

    18
Abstract: 

Automatic Topic detection seems unavoidable in social media analysis due to big text data which their users generate. Clustering-based methods are one of the most important and up-to-date categories in Topic detection. The goal of this research is to have a wide study on this category. Therefore, this paper aims to study the main components of clustering-based-Topic-detection, which are embedding methods, distance metrics, and clustering algorithms. Transfer learning and consequently pretrained language Models and word embeddings have been considered in recent years. Regarding the importance of embedding methods, the efficiency of five new embedding methods, from earlier to recent ones, are compared in this paper. To conduct our study, two commonly used distance metrics, in addition to five important clustering algorithms in the field of Topic detection, are implemented by the authors. As COVID-19 has turned into a hot trending Topic on social networks in recent years, a dataset including one-month tweets collected with COVID-19-related hashtags is used for this study. More than 7500 experiments are performed to determine tunable parameters. Then all combinations of embedding methods, distance metrics and clustering algorithms (50 combinations) are evaluated using Silhouette metric. Results show that T5 strongly outperforms other embedding methods, cosine distance is weakly better than other distance metrics, and DBSCAN is superior to other clustering algorithms.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 155

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 18 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Journal: 

LITERARY CRITICISM

Issue Info: 
  • Year: 

    2019
  • Volume: 

    12
  • Issue: 

    46
  • Pages: 

    49-68
Measures: 
  • Citations: 

    0
  • Views: 

    500
  • Downloads: 

    0
Abstract: 

Metalepsis, in its narratological sense, is a trope in which an unnatural relationship is built between different levels of narrative. The natural relationship between narrative levels is formed by the act of narrating; a character from one level becomes the narrator of another. The term “ Metalepsis of Topic and illustration” can be coined to name a similar trope. This trope has been used for centuries in Persian poetry. Every Image comprises a Topic and an illustration. The Topic is what is being talked about and the illustration is what the Topic is compared to. When several images are present along together, two different levels are distinguishable: the level of the Topic and the level of illustration. The natural relationship between these two levels is similarity and any other relationship will result in metalepsis. As in the narratological metalepsis, there is always a paradox in the metalepsis of Topic and illustration. The effect of these two kinds of metalepsis is also similar and can be humorous, fantastic, or a mixture of the two.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 500

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Author(s): 

Issue Info: 
  • Year: 

    2020
  • Volume: 

    8
  • Issue: 

    -
  • Pages: 

    439-453
Measures: 
  • Citations: 

    1
  • Views: 

    76
  • Downloads: 

    0
Keywords: 
Abstract: 

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 76

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 1 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
litScript
telegram sharing button
whatsapp sharing button
linkedin sharing button
twitter sharing button
email sharing button
email sharing button
email sharing button
sharethis sharing button